Generation of Correlated Binary Sequence from White Noise 
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We suggest a method for generation of random binary sequences with prescribed 

e : 

i , correlation properties. It is based on a kind of modification of the widely used 

convolution method of constructing continuous random processes. Apart from the 
theoretical interest, this method can be used in various applications such as the 
design of one-dimensional devices giving rise to selective transport properties. 
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PACS numbers: 05.40.-a, 02.50.Ga, 87.10.+e 



I. INTRODUCTION 

m 

. Generators of random numbers or white-noise signals are customary elements in modern 

O 1 

' digital electronics. Different algorithms are used for this purpose. The quality of a generated 



white noise is determined by the length of the sequence, elements of which can be considered 
as uncorrelated. In many areas of physics such as the engineering and signal processing, it 



is required to generate a colored noise, i.e. a correlated random process. Since the pair cor- 
relations usually give the principal contribution to the observable quantities, the problem of 



generation of a random sequence with the prescribed pair correlator is of particular interest. 
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^ ■ It has been known for a long time that continuous colored noise with exponential corre 

h ; n 

tions is generated by the linear Ornstein -Uhlenbeck process (see modern review in Ref. IK). 
A more general method, valid for the generation of continuous random sequences with an 
arbitrary correlator, is based on the convolution of white noise with the modulation function 
defined by the pair correlator. Originally, the convolution method was proposed by Rice.- 



Applications of some versions of this method for the generation of random sequences with 
specific correlations, including the long-range non-exponential correlations, can be found in 



Refs. 



In the theory of spatially disordered systems the role of the pair correlator (for random 
potentials) is emphasized by the fact that it determines the scattering cross section in the 
Born approximation (for weak potentials). As a result, many linear transport characteristics 
(conductance, transmission and reflection coefficients, localization length, etc.) are expressed 
through the pair correlator.-^ 

There are some examples of the systems (or processes) with correlated disorder, for which 
the fluctuating parameter takes discrete values. An example of such a system is the DNA 
macromolecule. Here the genetic information is written using four symbols that are the basic 
nucleotides. In the digital devices the information is transmitted in a form of a telegraphic 
signal, i.e. a binary code. The binary sequences is the limiting case of random sequences 
with the least number of the basic elements. For practical applications it is desirable to 
develop a method of generation of a binary sequence with prescribed pair correlator - a 
kind of colored noise containing two elements, e.g., "0" and "1". Although there were some 
attempts to obtain a robust algorithm for generation of a correlated binary sequence with 
the purpose of increasing the performance of a pulse radar,— the problem is still lacking a 
general solution. It is worth mentioning that there are methods of generation of a correlated 
binary sequence, which are not based on the properties of the correlation function, see, e.g., 



Ref. 



121 ]. It is not known yet, what are the constraints (if there are any) for the pair 



correlator, imposed by the fact that the sequence is dichotomous (binary). For example, an 
attempt to generate a dichotomous sequence with the correlations decaying according to the 
inverse power law, was unsuccessful.— 

Recently, we addressed the mathematical problem of generation of a dichotomous se- 
quence with prescribed correlation properties.— We concentrated our attention on statisti- 
cal properties of binary additive Markov chains. It was shown that some special classes of 
correlations can be reconstructed with the use of the so-called memory function. The latter 
is related to the pair correlator through a quite complicated linear integral equation. 

In this paper we present a new approach based on the convolution method that was 
modified for the generation of correlated binary sequences. The relation between the filtering 
function - the kernel of the convolution operator - and the pair correlator turns out to be 



relatively simple. The advantage of this method is that it requires less computation efforts 
to generate long sequences. 



II. CONVOLUTION METHOD 

The convolution method of generation of a continuous colored noise (3(n) starting from a 
white- noise a(n) is based on a linear transformation with the use of the modulation function 



G(n), see Refs. [ l6lJ7lJl5|. The most general form of this transformation is as follows, 



£ G(n-n')[a(n')-a}. (1) 

\ °al U J n'=-oo 



Here a and (3 are the mean values, and C Q (0) = a 2 {n) — (a) 2 and Cp(0) = f3 2 {n) — (f3) 2 are the 
variances of the white and colored noise respectively. For a homogeneous sequence a(n) the 
generated sequence /3(ri) is also homogeneous. In what follows, we introduce the normalized 
pair correlator Kp(r) = Cp(r)/Cp(0), which is an even function of r. Substituting the linear 
transformation ([!]) into the correlation function Cp(r), 



Cp(r) = (3(n + r)/3(n) - ((3) 2 = Cp(0)Kp(r) (2) 

and taking into account that the sequence a(n) is 5-correlated, the following relation between 
the pair correlator and the modulation function is readily obtained, 

oo 

Kp(r)= G(n)G(n + r). (3) 

n=— oo 

From the condition Kp(0) = 1 one gets, 

oo 

E G\n) = 1. (4) 



Calculating the product f3(n + r)(3(n), we took into account that the white noise a(n) is 
ergodic, i.e. the average over n can be replaced by the average over ensemble of white-noise 
sequences. 

Since Kp{n) and G(n) are even functions, it is convenient to apply the cosine Fourier 
transform to both sides of Eq. This results in the following relation, 

Kp{k) = Q\k) (5) 



where 



JC (k) = 1 + 2 V Ka(r) cos(kr), Kair) = - / Kpik) cos(kr)dk. (6) 

r=l * J ° 



Similar relations can be written for Q(k) and G(n). 

The expression ([5]) determines the modulation function in terms of the Fourier transform 
of the pair correlator, 

G(n) = - JCl /2 (k)cos(kn)dk. (7) 

TV Jo P 

Evidently, the solution (171) satisfies the normalization condition (jlj). 

For different white-noise sequences a{n) the convolution method, Eqs. (CQ), (El) and (CO), 
defines an ensemble of colored- noise sequences (3(n), possessing the same pair correlator 
Kp(n). The number of terms contributing to the series ([1]) depends on the sharpness of the 
correlator Kp{n). For short-range correlations, when Kp{n) decays very fast, the modulation 
function G(n) is also sharp, therefore, the principal contribution is mainly given by a single 
term with n' = n. The sequence f3(n) in this case is practically delta-correlated for both 
continuous and binary sequences a{n). In the opposite case of long-range correlations, when 
the correlation length R c is large (R c 3> 1), many terms contribute to Eq. (00). In this case 
even for a binary sequence a(n), employing the method of characteristic function, one can 
obtain that the probability density Pb{P) for stochastic variable f3(n) has the Gaussian form, 

09 -3) ~ 



2CX0) 



(8) 



provided the condition 

(/?-^) 2 /2^(0)« J R c (9) 

is fulfilled. The deviations from the Gaussian shape may appear only at the far tails, where 
(P — f3) 2 /2Ca(0) 3> R c - Note that for a continuous Gaussian distribution of a(n) Eq.® is 
exact for any value of R c . 

From the above consideration it is clear that the correlated sequence (3(n) may be gen- 
erated using very different uncorrelated sequences a(n), including the binary white noise. 
However, if we start with the binary white noise a(n), the generated sequence /3(n) is obvi- 
ously a non-binary one since at any site n the value of f3{n) in Eq. ([1]) results from a linear 
superposition of binary entries. This means that the direct application of the convolution 
method does not generate a binary correlated sequence. 



III. FILTERING PROBABILITY 



Let us now consider the problem of generating a binary sequence e(n) with prescribed 
correlations, assuming that the sequence a(n) is also binary. We suppose that both the 
sequence e(n) and a(n) contain O's and l's. Let the nth site for e(n) is associated with the 
number P n (0 < P n < 1), which is the probability of "1" obtaining at this site. In order 
to calculate the filtering probabilities P n from white noise a(n), in analogy with Eq. ([T]) we 
propose the linear transformation 



P n = e + 



E F(n-n')[a(n')-a}. (10) 

\ a \ ' n'=— oo 

Here C £ (0) = e(l — e) and C a (0) = a(l — a) are the variances of e(n) and a(n), respectively, 
and F(n) is an unknown modulation function to be determined. Having the value of P n , 
the nth symbol is generated by drawing randomly a number from the interval [0,1]. If this 
number is less than P n , then e(n) = 1, otherwise, e(n) = 0. Thus, a binary sequence e(n) 
can be generated, once the set of numbers {P n }^ ! l_ 00 is known due to Eq. ffTUl) . The values 
of P n are correlated in the following way, 



oo 



[P n+r -e][P n -e] = C £ (0) £ F{n)F{n + r), for r ^ 0. (11) 

n=— oo 

According to the method of generation of the sequence e(n) from the filtering probabili- 
ties, the probability of the symbol e(n + r) obtaining at the (n + r)th site does not depend 
on the emergence of the symbol e{n) at nth site (for r ^ 0). Therefore, the product P n+r P n 
gives the joint probability of 1 obtaining at the rath and (n + r)th sites. If appears at 



either of these sites, the corresponding pair does not contribute to the product e(n + r)e(n). 
Hence, the correlation function for sequence e(n) can be expressed through the correlation 
function of the filtering probabilities 



1 



[ e(n + r ) _ £) [e(n) - e] = Hm — — £ P n^P n - e 2 = [P n+r - e) [P n - e). (12) 

n=—N 

As one can see, the correlations in the sequence e(n) occur because of the correlations 
between the filtering probabilities. The latter are enforced by the modulation function, see 
Eq. (1101) . Thus, using Eqs. (TTTT) and (1121) . the relation between the correlator of the binary 
sequence and the modulation function can be written as 

oo 

K e (r)= F(n)F(n + r) for r^0; (13) 



KM = 1. 



(14) 



The normalization condition (|14p is the property of the correlator K e (r). It should be 
stressed that unlike Eq. that is valid for all values of r including r = 0, in the case of 
binary sequences the derived Eq. ( Tl3|) is not valid for r = 0. Therefore, the sum J2 n F 2 (n) 
remains undefined and has to be considered as a free constant, 

oo i f7T 

A= £ F 2 (n) = — / T 2 {k)dk. (15) 

This constant appears now in the Fourier transform of Eq. (IT5|) as follows, 

K 6 (k) = 1- A + F 2 {k) (16) 

Using Eq. (1161) the relation between the modulation function and the pair correlator of the 
binary sequence can be written in the form, 

F(n) = - [K e {k) - 1 + A} 1/2 cos(kn)dk. (17) 
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Thus, Eqs. ( flOl) and ( fTTl) define the algorithm of the generation of a binary sequence with 
the prescribed pair correlator. 

The unknown constant A has to satisfy to the condition that the values of the filtering 
probabilities ffTUl) do not exceed 1, 



Eminfe 1 — e) \ — a) 
\F(n)\ < 1 ' ' -V- ^<1. (18) 
le(l-e) max(a,l-a) " 



Taking into account that the argument of the square-root in Eq. (|T7|) must be positive, this 
inequality can be rewritten (for a = e = 1/2) in terms of the constant A, 

oo oo 

l-K e (k)<A= £ F 2 {n)< J2 \F(n)\ < 1. (19) 

7i=— oo n=— oo 

The constraints imposed by Eqs. (fl8|) and (fl9l) limit the class of generated binary sequences 
that can have a given pair correlator. In particular, a binary sequence with the slow-decaying 
correlator K e (n) = sin(an)/an cannot be generated with the use of the proposed algorithm 
since the sum J2 n l-^( n )l diverges in this case. As is known, the power-decaying correlators 
provide an emergence of a kind of mobility edges in systems with random one-dimensional 
potentials.- 1 ^ In this case, a sharp transition from localized to delocalized eigenstates occurs 



when crossing some value in the energy spectrum, specified by the mobility edge. Although 
such mobility edges were experimentally observed for a continuous distribution of fluctua- 
tions in artificially fabricated site-potentials with long-range correlations,— it is not clear 
yet, if they do exist for a correlated binary sequence. That is why the existence of a mobility 
edge in a sequence of nucleotides in a DNA molecule is still questionable.— ^ 

As an example, let us consider the exponential binary correlator with the corresponding 
Fourier representation, 

J^(r)=exp(- 7 |r|), JC £ (k) = — r - (20) 

cosh 7 — cos k 

Since K e {k) reaches its minimum at k = tt, the condition 1 — IC e (k) < A is satisfied for 
A = 1 — /C £ (7r) = 1 — tanh(7/2). Therefore, we have, 

^-9/, \ 1 + cos k , 7 

T fc = — u z tanh i ' ( 21 ) 

cosh 7 — cos k 2 

and the function F{n) reads 



Fin) = -Jtanh^ f cos(kn)J 1 + cosk dk . (22) 
v ; ttV 2 Jo v ; V cosh 7 - cosfc v ; 



For n ^> 1 the modulation function decays as follows, 



1 



>n+l 



F(n) w — ^ , . . 4 /tanh - oc 1/n 2 , (23) 

v ; 2n 2 cosh( 7 /2) V 2 7 ' v ; 

therefore, the sum J2n\F( n )\ converges. It, however, exceeds 1 for 7 < 7 cr ps 1.60, thus 
violating the last inequality in Eq. (TiT?]) . The numerical simulation shows that for 7 > 7 cr 
the method works quite well, giving a possibility to construct binary chains of length 10 7 
with the prescribed correlator K(r) within a few percents of accuracy for r < 5. 

In conclusion, we suggest a method of filtering probabilities to construct a binary corre- 
lated sequence from a white noise. The proposed algorithm consists of the following steps. 
First, starting from the prescribed mean value e, variance C e (0) and power spectrum JC e (k), 
one calculates the filtering function F(n) by making use of Eq. (I17p . The next step is 
optimization of the value of the constant A according to Eq. ( [191 . Then, the filtering prob- 
abilities P n are calculated from Eq. ( FlOl) . Finally, for any site n by comparing the value of 
P„ with a number drawn randomly from the interval [0,1], one gets the number or 1 that 
create the binary sequence /3(n). 
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